Predicting Chinese Abbreviations with Minimum Semantic Unit and Global Constraints

نویسندگان

  • Longkai Zhang
  • Li Li
  • Houfeng Wang
  • Xu Sun
چکیده

We propose a new Chinese abbreviation prediction method which can incorporate rich local information while generating the abbreviation globally. Different to previous character tagging methods, we introduce the minimum semantic unit, which is more fine-grained than character but more coarse-grained than word, to capture word level information in the sequence labeling framework. To solve the “character duplication” problem in Chinese abbreviation prediction, we also use a substring tagging strategy to generate local substring tagging candidates. We use an integer linear programming (ILP) formulation with various constraints to globally decode the final abbreviation from the generated candidates. Experiments show that our method outperforms the state-of-the-art systems, without using any extra resource.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unit commitment by a fast and new analytical non-iterative method using IPPD table and “λ-logic” algorithm

Many different methods have been presented to solve unit commitment (UC) problem in literature with different advantages and disadvantages. The need for multiple runs, huge computational burden and time, and poor convergence are some of the disadvantages, where are especially considerable in large scale systems. In this paper, a new analytical and non-iterative method is presented to solve UC p...

متن کامل

Actual impacts of global warming on winter wheat yield in Eastern Himalayas

Himalayas, are among the areas most vulnerable to global warming, however, little is knownabout warming impacts on the crops. Therefore, the actual affects of anticipated warming onwinter wheat were tested in Tibet, China. During the period 1988-2012, Tibet region hasexperienced a large increase in daily mean, minimum and maximum temperatures during wheatgrowing seasons by 0.50, 0.67 and 0.51 o...

متن کامل

The lexical constituency model: some implications of research on Chinese for general theories of reading.

The authors examine the implications of research on Chinese for theories of reading and propose the lexical constituency model as a general framework for word reading across writing systems. Word identities are defined by 3 interlinked constituents (orthographic, phonological, and semantic). The implemented model simulates the time course of graphic, phonological, and semantic priming effects, ...

متن کامل

A New Hybrid Heuristic Technique for Unit Commitment and Generation Scheduling

This paper proposes a novel technique for solving generation scheduling and ramp rate constrained unit commitment. A modified objective function associated with a new start-up cost term is introduced in this paper. The proposed method is used to solve generating scheduling problem satisfying SRR, minimum up and down time as well as ramp rate constraints. Two case studies are conducted to imp...

متن کامل

Modeling feasibility and prediction of minimum and maximum temperature in Iran by bettitt and Holt-Winters methods

Air temperature is one of the most frequently used parameters in the assessment of climate change at global and regional scale. So researchers have tried to modeling and predicting it with different models. This study also aims to model and predict the country's monthly minimum and maximum temperature. Investigates of temporal temperature changes is done by Sen’s estimator and Pettit method and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014